Cascaded Regional Spatio-Temporal Feature-Routing Networks for Video Object Detection
نویسندگان
چکیده
منابع مشابه
Deep Spatial-Temporal Joint Feature Representation for Video Object Detection
With the development of deep neural networks, many object detection frameworks have shown great success in the fields of smart surveillance, self-driving cars, and facial recognition. However, the data sources are usually videos, and the object detection frameworks are mostly established on still images and only use the spatial information, which means that the feature consistency cannot be ens...
متن کاملSpatio-temporal Object Detection Proposals
Spatio-temporal detection of actions and events in video is a challenging problem. Besides the difficulties related to recognition, a major challenge for detection in video is the size of the search space defined by spatio-temporal tubes formed by sequences of bounding boxes along the frames. Recently methods that generate unsupervised detection proposals have proven to be very effective for ob...
متن کاملSpatio-temporal Video Parsing for Abnormality Detection
Abnormality detection in video poses particular challenges due to the infinite size of the class of all irregular objects and behaviors. Thus no (or by far not enough) abnormal training samples are available and we need to find abnormalities in test data without actually knowing what they are. Nevertheless, the prevailing concept of the field is to directly search for individual abnormal local ...
متن کاملSpatial-Temporal Memory Networks for Video Object Detection
We introduce Spatial-Temporal Memory Networks (STMN) for video object detection. At its core, we propose a novel Spatial-Temporal Memory module (STMM) as the recurrent computation unit to model long-term temporal appearance and motion dynamics. The STMM’s design enables the integration of ImageNet pre-trained backbone CNN weights for both the feature stack as well as the prediction head, which ...
متن کاملSpatio-Temporal Video Search Using the Object-Based Video Representation
Object-based video representation provides great promises for new search and editing functionalities. Feature regions iii video sequences are automatically segmented, tracked, and grouped to form the basis for content-based video search and higher levels of abstraction. We present a new system for video object segmentation and tracking using feature fusion and region grouping. We also present e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2018
ISSN: 2169-3536
DOI: 10.1109/access.2017.2787155